Speaker Accent Recognition by MFCC Using K- Nearest Neighbour Algorithm: A Different Approach
نویسنده
چکیده
A K-Nearest Neighbour Algorithm involving Mel-Frequency Cepstral Coefficients (MFCCs) is provided to perform Speech signal feature extraction for the task of speaker accent recognition. Mel-Frequency Cepstral Coefficient is effectively used to perform the feature extraction of the input signal. For each input signal the mean of the MFCC matrix is used for pattern recognition .The K-nearest neighbour algorithm is based on evaluating minimum Euclidean distance measure from input data set to stored data set. Since large number of speakers of different accent are present, they can be grouped together depending upon their accent .Thus each signal coming from different group makes a distinct MFCC vector .In this paper we have compared the MFCC from global group to smaller sub groups.
منابع مشابه
A Comparison of Classifiers in Performing Speaker Accent Recognition Using MFCCs
An algorithm involving Mel-Frequency Cepstral Coefficients (MFCCs) is provided to perform signal feature extraction for the task of speaker accent recognition. Then different classifiers are compared based on the MFCC feature. For each signal, the mean vector of MFCC matrix is used as an input vector for pattern recognition. A sample of 330 signals, containing 165 US voice and 165 non-US voice,...
متن کاملText-dependent speaker recognition by efficient capture of speaker dynamics in compressed time-frequency representations of speech
Prevalent speaker recognition methods use only spectralenvelope based features such as MFCC, ignoring the rich speaker identity information contained in the temporalspectral dynamics of the entire speech signal. We propose a new feature called compressed spectral dynamics or CSD for speaker recognition based on a compressed time-frequency representations of spoken passwords which effectively ca...
متن کاملتشخیص لهجه های زبان فارسی از روی سیگنال گفتار با استفاده از روش های استخراج ویژگی کارآمد و ترکیب طبقه بندها
Speech recognition has achieved great improvements recently. However, robustness is still one of the big problems, e.g. performance of recognition fluctuates sharply depending on the speaker, especially when the speaker has strong accent and difference Accents dramatically decrease the accuracy of an ASR system. In this paper we apply three new methods of feature extraction including Spectral C...
متن کاملFuzzy Nearest Prototype Classifier Applied to Speaker Identification
In a vector quantisation (VQ) based speaker identification system, a speaker model is created for each speaker from the training speech data by using the k-means clustering algorithm. For an unknown utterance analysed into a sequence of vectors, the nearest prototype classifier is used to identify speaker. To achieve the higher speaker identification accuracy, a fuzzy approach is proposed in th...
متن کاملBeating Henry Higgins at His Own Game: A Markovian Approach to Dialectology
1. Introduction The performance of speech recognition algorithms degrades considerably due to speaker variability. Aside from gender, the largest cause for speaker variability is accent. If the accent of a speaker can be determined automatically, then accent-specific speech recognition models can be used, thereby increasing speech recognition accuracy. In this study, the problem of accent class...
متن کامل